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^ ! 

, Abstract 

^ ' We prove conditional negative association for random variables Xj = l{|CT-i(j)|>tj.} (j G [n] 

{!,..., n}), where cr(l), . . . , a{m) are i.i.d. from [n]. (The cr(«)'s are thought of as the locations 
of balls dropped independently into urns 1, . . . , n according to some common distribution, so 



in 



that, for some threshold tj, Xj is the indicator of the event that at least tj balls land in urn 



j.) We mostly deal with the more general situation in which the (T(i)'s need not be identically 
I distributed, proving results which imply conditional negative association in the i.i.d. case. Some 

■ of the results — particularly Lemma 8 on graph orientations — are thought to be of independent 

interest. 

We also give a counterexample to a negative correlation conjecture of D. Welsh, a strong 
version of a (still open) conjecture of G. Farr. 



1 Introduction 

> , 

. Competing urns refers to the experiment in wliich m balls are dropped, randomly and independently, 

\^ \ into urns 1, . . . , n. Formally, we have a random a : [m] [n] (where [m] = {!,..., m}) with the 

' cr(i)'s independent. We then take xj to be the indicator for occupation of urn j and are interested 

in the law, /i, of (xi, . . . ,x„) (a measure on {0, 1}"). In the traditional case where the balls are 
I identical (i.e. the a{i)'s are i.i.d.) we call /i an urn measure, or, for emphasis, an ordinary urn 

^ ' measure. More generally, setting Bj = |o'^^(j)[, we may consider thresholds ii, . . . and let Xj 

be the indicator of {Bj > tj}; for i.i.d. balls, we then call the law of (xi, . . . ,x„) a threshold urn 
measure. When the balls are not required to be identical we speak of generalized urn measures and 
^ ' generalized threshold urn measures. 

■ We are interested in correlation properties of these measures, but before proceeding further 

need to briefly recall a few definitions. A fuller version of the following discussion is given in [10], 
and further background and motivation may be found e.g. in [15]. 

Recall that events ^, ;B in a probability space are positively correlated — we write A t B — if 
Pt{AB) > Pr(^) Pr(0), and negatively correlated {A B) if the reverse inequality holds. 

We will be interested in measures on finite product spaces = HiLi with each rij a chain 
(totally ordered set), often simply {0, 1}. We use ^A{Q,), or simply M, for the set of probability mea- 
sures on 0, and Ms for M.{{0, 1}"^). We will occasionally identify {0, 1}"^ with 2'^ ({subsets of S} 
ordered by inclusion) in the usual way. 

Recall that an event ^ C is increasing (really, nondecreasing) x > y £ A =^ x £ A 
(where f2 is endowed with the product order) , and similarly for decreasing. For real- valued random 
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variables X, Y, write X if 



{X>s}i{Y> t} Vs, t G % 



(1) 



or, equivalently, if 



Ef{X)g{Y) < Ef{X)Eg{Y) for all increasing f,g:^^^. 



(2) 



(N.B. this differs from the usage in [15]. Of course X Y means the reverse inequalities hold, but 
we don't need this.) 

Say i G [n] affects .4 C J7 if there are ij E A and r G \ .4 with rjj = tj "ij ^ i, and write 
„4 -L jB if no coordinate affects both A and B. Then fi £ A4 is negatively associated (or has negative 
association; we use "NA" for cither) ii A X ^ whenever A, B are increasing and A -L B. We say fi 
has negative correlations (or is NC) if rji \. rjj (where r] is the random string) whenever i ^ j. 

We are primarily concerned with conditional negative association: fj, E Ai is conditionally 
negatively associated (CNA) if any measure obtained from /j, by conditioning on the values of some 
of the variables is NA. (Throughout the paper we assume that any conditioning event we consider 
has positive probability.) Conditional negative correlation (CNC) for is defined similarly. 

When n = {0, 1}", stronger properties are obtained by demanding NC (resp. NA) for every 
measure W o /j. e M of the form 



with W = {Wi, . . . , Wn) G . (Borrowing Ising terminology, one says that W o ^is obtained from 
IX by imposing an external field.) Then n is said to be Rayleigh or NC+ (resp. NA+), the reference 
in the former case being to Rayleigh's monotonicity law for electric networks (see e.g. [4] or [3]). 

The competing urns model was explored in some detail by Dubhashi and Ranjan [5]*, who proved 
inter alia that threshold urn measures are NA. Another proof of this is given in [15]. Actually the 
argument of [5], which proves the stronger statement that the (law of the) r.v.'s 



is NA, does not require identical balls. (The argument of [15] does not work for nonidentical balls.) 
The main purpose of the present note is to prove 

Theorem 1 Threshold urn measures are CNA. 

In contrast, as observed in [10], even ordinary urn measures need not be Rayleigh (but see the 
remark on R'^ in Section 5). We don't know whether Theorem 1 extends to nonidentical balls 
(again see Section 5). 

Let us quickly say what Theorem 1 has to do with [10]. Following [15], we say that /i G 
7W({0, 1}") is ultra-log-concave (ULC) if its rank sequence, {r^ := iJ,{\rj\ = ■i)}"=o (where \rj\ = X^r/j), 
has no internal zeros and the sequence {rj/(^)}-Lg is log-concave. A set of four conjectures from 
[15] (see his Conjecture 4) states that each of CNC, CNA, NC+ and NA-|- implies ULC; but, as 
shown in [1] and [10], even the weakest of these (NA+ ULC) is false. Theorem 1 provides a 
more natural counterexample to the stronger "CNA ULC," since, as observed in [10] (disproving 
another conjecture from [15]), urn measures need not be ULC. 

*They say "bins" rather than "urns." 



^ij — l{(T(i)=j} 



(3) 
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The proof of Theorem 1 gives something a httle more general, as follows. Suppose that for each 
j G [n] we are given a sequence = ao(j) < • • • < ak.{j) = m + 1, and for a : [m] — )■ [n] set 

xj(a) = t iff at{j) < B, < at+i{j). (4) 

Theorem 2 // the a{i) 's are i.i.d. then the Xj 's in (4) are CNA. 

Call the law of (xi, . . . , x„) as in (4) a (generalized) interval urn measure. 

The paper is organized as follows. Section 2 reduces Theorem 2 to either of our two main 
inequalities, (10) and (14). Each of these is valid at the level of generalized interval urns; they arc 
equivalent in the case of ordinary urns but not obviously so in general (though the argument in 
Section 3 uses some interplay between the two). It is only in the derivation of Theorem 2 from (10) 
that we need the o"(i)'s to be i.i.d. 

We give two quite different ways of getting at these main inequalities. Theorem 4 in Section 
3 essentially restates (10) and (14) in induction- friendly form; the proof of the theorem given in 
this section is inspired by [5]. Section 4 takes a different approach, based on a graph-theoretic 
observation, Lemma 8, that is thought to be of independent interest. The lemma is used to: 
reprove (10); in combination with a result from [11] (Theorem 11 below), to prove a stronger, 
ultra-log-concavity version of (14); and to prove "log-submodularity" for some classes of measures. 

Finally, Section 5 contains some discussion of the question of whether Theorem 1 extends to 
nonidentical balls, mentions a conjecture of G. Farr and a stronger one of D. Welsh, and sketches 
a counterexample to the latter. 

Some notation. For a nonnegative vector 

J = {jij :i e[m],j e[n]), (5) 

A C [m] and K C. [n], the probability measure on (functions from A to K) corresponding to 7 
is that given by 

Pr(c7)cxT^(a):=n7i,aW- (6) 

Thus the r.v.'s a{i) are independent; they are i.i.d. if 7ij does not depend on i, in which case 
we write simply 7^. We also use Pr^ {L C [m]) for the measure on [n]^ corresponding to 7 (so 
Pr = PrH). 

2 Setting up 

Let the law of cr G [n] be given by (6) and let xi , . . . , x„ be as in (4) . Let 7 U J U K be a partition 
of [n] and tj G {0, . . . , fcj — 1} for j G if, and set 

Q:={^{a) = tonK} (= {at,(j) < k-^i)! < Vj G i^}), (7) 

X = |c7~^(/)| and Y = |(T~^(J)|. The main point for the proof of Theorem 2 is 

XiY given Q, (8) 

given which we finish easily: 

Proof of Theorem 2. With notation as above, let A,B Q he increasing events determined by 
I and J (more precisely, by the values of the variables Xj indexed by I and J) respectively. For 
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Theorem 2 we should show A i B given Q. Define /, 5 : N ^ 3? (where N = {0,1,...}) by 
f{k) = Pr(^|X = k), g{l) = Vi{B\Y = I). A standard couphng argument shows that / and g are 
increasing, whence, according to (8), 

Pr(^n-BIQ) = £[f{X)g{Y)\Q] < £[f {X)\Q]£[g{Y)\Q] = Pr(^|Q) Pr(^|Q) 

(where the first equaUty follows from conditional independence of A and B given {X,Y)). 

I 

We continue to condition on Q and write fik for the law of Y given {X = k}; that is, 

fikil) = Pr(y = l\X = k). (9) 

We will actually prove 

(whenever neither side is 0/0, where we agree that x/0 = 00 for x > 0), which is a strengthening 
of (8) once we rule out some pathologies. We recall the standard 

Definition 3 C C N" is convex if a,c C and a <b < c imply b & C. 

It will follow from Proposition 6 below that 

supp(Pr) := {{k, I) : Pr(X = k,Y = I) > 0} is convex. (11) 

Given this, (10) implies that Y is stochastically decreasing in X — that is, 

f^k+iiY>t) <fik{Y>t) yk,t 

(the easy implication is essentially Proposition 1.2 of [15]) — which in turn easily implies X | F. 
Let 

Z =\a-^{IUJ)\. (12) 

When the (7(i)'s are i.i.d., an alternate way to specify X and Y is: let Z be as in (12), X ~ Bm{Z, a) 
and Y = Z — X, where a = 7//7/UJ (with 7/ = J2i£i n) and Bin(Z, a) is the binomial distribution 
with parameters Z and a. 

In general, for v the law of an N-valued r.v. Z and a G [0,1], let X = Xy^a ~ Bin(Z, a), 
Y = Yy^a = Z — X and, for lack of a better name, say v is binomially negatively associated (BNA) 
ii X ],Y for every a. Call a nonnegative sequence a = (ai)^Q strongly log-concave (SLC) if 

ia| > {i + l)ai_iai+i Vz > 1 (13) 

(that is, (i!ai)^Q is log-concave), and say u G A1(N) is SLC if the sequence (z^(i))^Q is. A 
straightforward calculation shows that this is equivalent to saying that (10) holds for any a, X = 
Xy^a and Y = Yy^a (and Hk as in (9)): since 

_ u{k + l)Vi{X = k\Z = k + I) _ u{k + l){^f)a^{l-ay 
Pr(X = fc) " FT{X = k) 

we may rewrite (10) as 

v{k + l + 2){%f).{k + <u{k + l + l){'tifHk + I + l)(^+^+^ 
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which is SLC for v. (If v is Poisson — that is, if (13) holds with equahty — then X and Y are 

independent Poisson r.v.'s and the incquahties (1) are equalities.) Thus, in the i.i.d. case, (10) is 
equivalent to saying that Z as in (12) is SLC. The latter again turns out to be true at the level of 
generalized urns; that is, for any 7 as in (5), a G [n]["^^ with law given by (6), Q as in (7) and Z as 
in (12), 

the law of Z is SLC. (14) 

It's also easy to see that absence of internal zeros in {^{i)) is equivalent to (11) for X = X^^a, 
Y = Yv,a (which, again, is given by Proposition 6), so that (14) again implies (8) (and Theorem 2). 
It seems interesting that both (10) and (14) are valid for generalized urns, though the equivalence 
that holds for i.i.d. balls disappears in the more general setting. 

As mentioned earlier, in Section 4 we will combine Lemma 8 with a result from [11] to obtain 
an improvement of (14): 

the law of Z is ultra-log-concave. (15) 

(Recall — see following Theorem 1 — this means that the sequence {Pr(|Z| = i)/{^)}^Q is log- 
concave without internal zeros.) 



3 First proof 

Let Pr be the measure on [n]'"*] corresponding to some 7 (see the end of Section 1), and for 
a, 6 G N"-^ and fc G N, set 

p{k, a, h) = Pr(5„ = k\Bj G [aj,hj\ Vj G [n - 1]) (16) 

(recalling that Bj = \o-~^{j)\). 

Theorem 4 With notation as above, 

p{k + l,a,b) . . . . . ,s , 

(a) — — — IS nomncreasmg m [a,0), ana 

p(k, a, 0) 

p{k + l,a,6) ^ k p{k,a,b) 
^ ' p{k,a,b) - k-\-l' p{k-l,a,b)' 

where we say nothing about the case 0/0 and agree that x/0 = 00 when x > 0. 

See also Theorem 14 in Section 5 for a related result. 

As noted earlier, part (b) of Theorem 4 is just a reformulation of (14), while (a) is a mild 

generalization of (10) (which in fact — see (19) — quickly reduces to (10)). To see this, note that in 
(10) we may assume that each of /, J is a singleton, say K = [n — 2], / = {n — 1}, J = {n} — formally 
we could pass to 

' 7ii iijeK = [n-2] 

T,{lij - 3^1} if J = n - 1 
E{7ii : J e J} if j = n 

— and similarly in (14) we may assume K = [n — 1], I = {n} and J = 0. Then Theorem 4(b), 
which may also be stated 

for fixed a,b E N'"'"^] the sequence {p{k, a, b)} is SLC, 
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is (up to some name changes) the same as (14), while (10) is equivalent to 



p{k + 1, a, b)/p{k, a, b) > p{k + 1, a', b')/p{k, a', b'), 

where a„_i = bn-i = a'n-i = ^n-i = ^ + 1 (for some t) and aj = a'p bj = b'j for j E [n — 2]. 

On the other hand, the inductive proof of Theorem 4 employs both the more general form of 
(a) and some interplay between the two parts. 

Before proving the theorem we note one further consequence and give the promised Proposition 
6. For f,a e N", let Mf{a) = {a e [n][™] : \a-^{j)\ € [aj,aj + fj] Vj G [n]} and M/(a) = 
PT{Mf{a)). Though we won't use the next result (but see the remark following Corollary 9), it 
seems natural and worth mentioning. 

Corollary 5 For each f G N", M = Mf satisfies the negative lattice condition: 

M{a)M{c) > M{aW c)M{aAc) Va,cGN". (17) 

This is more or less immediate from Theorem 4 once we have the next little observation, which, as 
noted earlier, also gives (11) and absence of internal zeros in the law of Z in (12). 

Proposition 6 For any f and Mf as above, the support of M = Mf is convex. 

Proof. This will follow easily from 

Claim. For any cr, r G [n]["*l with Pr((7),Pr(r) > and i G [n] with |(T~-^(z)| > |r~^(z)|, there are 
j G [n] and p G [n]H with Pt{p) > 0, < |T~^(j)| and 

\p-Hk)\ = < 



cT-i(i)|-l if k = i 
a-\j)\ + l ifk = j 
a-\k)\ iike[n]\{i,j}. 



This is a standard type of graph-theoretic observation: regarding a and r as edge sets of bipartite 

graphs on [m] U [n] in the natural way,^ wc need a path with edges alternately from cr \ r and r \ o" 
that begins with a a-edge at i and ends with a r-edge at some j as above. (We then get p by 
switching a and r on this path.) We omit the routine proof that such a path must exist. 

To prove Proposition 6, we should show that for all distinct a,b,c G N" with a < b < c and 
a, c G supp(Af). wc also have b G supp(M). Of course it suffices to show this when there is some 
i G [n] with 6^ = q — 1 and b^ = for all k ^ i. Choose t € A4{a) := ^Af{a) and a & Ai{c) with 
Pr(T),Pr(cj) > 0. We assume |(T~^(i)| = Cj + /j, since otherwise a G M(b) and we are finished. 
Letting j,p be as in the claim (note |t~^(z)| < q + /,), we have 

\p-\i)\ = bi + fi 

and 

\P~Hj)\ = + 1 e [cj + 1, aj + fj] C [bj, bj + fj], 

whence p e M{b) and b G supp(M). 



■''We pretend [m] n [n] = 0. 
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Proof of Corollary 5. It is easy to see (and standard) that convexity of M (given by Proposition 6) 
implies that it's enough to prove (17) when there are indices i and j with ai = Ci — 1, aj = Cj + 1, 
and Ofe = Cfe for all k ^ In this case — assuming, w.l.o.g., that i = n — 1 and j = n — we set 



and 



Then (17) is 



p^{k) = Pr(B„ = k\Bi e [ai, ai + fi] VZ G [n - 1]) 
P2{k) = Pr(B„ = k\Bi G [ci,ci + fi] VZ G [n - 1]). 



'C„+f„ + l \ /c„+f„ \ /cn + fn + 1 \ /Cn+fn 

E ^'i(^) E P2(A;) > E p^ik)\ E p^i'') 

. k=C„+l I \ k=Cn I \ fe=C„ + l / V fe=Cn 



and follows immediately from 

Pi{k)p2{l) > Pi{l)P2ik) whenever k > I, 
which is a consequence of Theorem 4(a) (and Proposition 6). 



We now assume (as we may) that J^j Hj = 1 for each i. The proof of Theorem 4 resembles that 
of Theorem 33 in [5] , and is based on 

Observation 7 For any i G [n], fc G N, and event Q determined by ■ j 7^ i), 

Pr(B, = A; + 1, Q) = ^ E 7a PrH\{0(B, = k, Q). 
" + ^ ie[m] 

(Recall Pr^ was defined at the end of Section 1.) We also use the trivial 

. ai . ai-\ h Ofe ai 

mm — < — — < max — (18) 

i Pi - Pl + --- + Pk ~ i Pi 

(for all ai, . . . , Uk, /3i, . . . , /3fc > with /3i + • • • + /3fc > 0, where, again, x/0 := 00 when x > 0). 

Proof of Theorem 4. We proceed by induction on m, omitting the easy base cases with m = 1. 
For (a), it's enough to show that the ratio in question does not increase when we increase a single 
entry — w.l.o.g. the (n — l)st — of one of a, b. Thus, by (18), it suffices to show that 

PT{Bn = k + l\Bn-l=t,R) . ■ ■ ^ 

— — — TT^r — — IS nonmcreasmq m t, 

Pr{Bn = k\Bn-i = t,R) 

where R = {uj < Bj < bj Vj G [n — 2]}; and by Proposition 6, this will follow if we show 

FriBn = k + l\Bn-i = t + l,R) ^ PrjBn = k + l\Bn-i = t, R) 

PT{Bn = k\Bn-l=t+l,R) - Pl{Bn = k\Bn-l=t,R) ^ ' 

for all t for which the probabilities appearing in (19) are positive. (This is the easy reduction of 
(a) to (10) mentioned earlier.) 
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By Observation 7 we may write the left side of (19) as 

EfgH ^'."'1 Pr["]\^'>(^n = A; + 1, Bn-i = t, R) 
Elelm] %n-i PrH\{'}(i?„ = k, Bn-i = t,R) ' 

which, by (18), is at most 

PrH\W(B„ = fc + =t,i?) 
/gM PrH\{0 = k\Bn^i = t, R) 

Thus, setting Q = {Bn-i = t} AR and assuming (w.l.o.g.) that the maximum occurs at I = m, we 
will have (19) if we show 

Pr(ij^ = fc + 1|Q) ^ Pr^"'-^(i?^ = fc + l|Q) 
Fr{Bn = k\Q) - Prt— il(B„ = A;|Q) ' ^ ^ 

Now 

Pr(5„ = k+l\Q) _ E,g[„] Pr(a(m) = i|Q) Pr(5„ = k + l\Q, a{m) = j) 



FT{Bn = k\Q) j:^^^^^^FT{a{m)=j\Q)Pv{Bn = k\Q,aim)=j) ' 

so that (20) will follow (again using (18)) from 

Fr{Br. = k+l\Q,a{m)=j) Prl-^ (i3„ = + 1|Q) 
Pr(5. = fc|Q,a(m)=,) - Pr[™-i](i?„ = fe|Q) ' ^ ^''^ 

(where, again, "for all j G [n]" really includes only those for which Pr((5, cr(m) = j) > 0). 
There are three cases to consider. If j = n, the left side of (21) is 

Pr['"~^l(B„ = k\Q) 
Pr['"-il(B„ = fe-l|Q)' 

which is at least the right side of (21) by (part (b) of) our induction hypothesis. If j = n — 1, the 
left side of (21) is 

Pr["'-"^-(I?„ = A- + l\Dn-i = i - l,R) 
Pr["^-il(5„ = A;|S„_i = t-l,i?) ' 

which is at least the right side of (21) by (part (a) of) the induction hypothesis. Finally, if j ^ 
n — l,n, the left side of (21) is 

Pv^"'-^\Bn = k + l\Bn-i = t,R*) 

Fr^"'-^\Bn = k\Bn-i = t,R*) ' 

where R* is obtained from R by replacing the condition aj < Bj < bj by the condition aj — 1 < 
Bj < bj — 1; again this is at least the right side of (21) by part (a) of the induction hypothesis. 

We now turn to (b) and set Q = {aj < Bj < bj Vj G [n — 1]}. Then we have, again using 
Observation 7 and (18), 

Pr(i?^ = fc + 1[Q) ^ kY:ielm]7inP^^"'^''^'HB^ = k,Q) 

Pr(S„ = k\Q) [k + 1) E;gM lin PrM\{0(i3„ = A; - 1, Q) 

^PrM\{0(5^^ = k,Q) 

< max . 

ieM (fc + 1) PrH\W(5^ = )t - 1, Q) 

w.Lo.g. fcPr["^-^l(^n = A:,Q) 

(A; + l)Pr["*-il(5„ = A;-l,Q) 
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(noting that we may assume, by Proposition 6, that Pr(B„ = r\Q) is positive for r G {k—1, k, k+1}); 
so we will be done if we can show 



Pr["*-il(5„ = k- 1\Q) - Pr{Bn = k- l\Qy 
Proceeding as in the proof of part (a) , we may rewrite 

Pr(S„ = k\Q) Eje[n] Pr(a(m) = j\Q) Pr(5„ = k\Q, a{m) = j) 



Pr(S„ = k-l\Q) E,eM Pr(fT(m) = j\Q) Pr(5„ = k- 1\Q, aim) 
so for (22) it is enough to show that, for each j € [n], 

PT{Bn = k\Q,a{m)=j) ^ PrM(5„ = fc|Q) 



Pr{Bn = k-l\Q,a{m)=j) ' Pr['"-il(B„ = A; - 1|Q) ' 

which, as did (21), follows easily from our induction hypothesis (here we only need to consider the 
two cases j = n and j ^ n). 



4 A graphical approach 

We begin here with a natural and seemingly new graph theoretic statement which we regard as 
the main point of this section. Given a multigraph G on vertex set V and a, 6 G N^, let 0{a, b) = 
Oaia, b) be the set of orientations of G for which 

d'^{x) > Qx and d~{x) > bx for all a; G F 

and N{a, b) = Nc^a, b) = \0{a, b)\. Here d'^ and d^ are, as usual, out- and in-degrees. We will also 
use dx for the degree of x in G. Note we regard a loop (at x, say) as having two orientations, each 
of which contributes 1 to each of d'^ (x) and d" (x) . 

Lemma 8 // a, b,r,s E satisfy 

a > r,s and a + b > r + s 

(where the inequalities are with respect to the product order on N^j, then 

N{a,b) < N{r,s). (23) 

Of course the idea is that it's harder to satisfy a set of demands that always requires large out- 
degrees than one for which these requirements are mixed. For the sake of comparison, let us also 
mention the specialization of Corollary 5 to the present situation: 

Corollary 9 Ifa + b = r + s then 

N{a, b)N{r, s) > N{a V r, 6 A s)N{a A r, 6 V s). (24) 

Proof. Interpret vertices of G as urns and edges as balls, and assume that for each edge (ball) e 
we have ^ex = 1 or according to whether x is or is not an end of e. Then (24) is just Corollary 5 
with f = d — a — b{=d — r — s), where d = {dx ■ x G V{G)) is the vector of degrees. 
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Remark. It's possible to simplify the proof of Lemma 8 using Corollary 9; but of course this depends 
on Theorem 4, so is really harder than the following direct proof. On the other hand, it's not too 
hard to derive Theorem 4(a) from Lemma 8; see [14]. (And below we use Lemma 8 to prove (15), 
which is stronger than Theorem 4(b).) 

Proof of Lemma 8. We proceed by induction on if{G, a, b) := \E{G) \ + J^xevi^x — o,x — bx), calling 
X G V saturated if Ux + bx = dx- Since N is nonincreasing in each of its arguments, we may assume 
a -\- b = r + s (or we can increase r or s) . 

Suppose first that there is at least one saturated vertex, x. We may assume there are no loops 
at X, since otherwise (23) follows easily from the induction hypothesis applied to the graph gotten 
from G by deleting such loops. Let a = Qx, P = bx, p = rx,o' = Sx, and let X = {ei, . . . , Ca^^} be 
the set of edges incident with x. 

Consider a set tt consisting of /3 pairs {ej, ej} C X, with the 2/3 edges appearing in tt distinct, 
and, say, yi the vertex joined to x by Cj (so the y^'s need not be distinct). Let G{'k) be the graph 
with vertex set V \ {x} and edge set E{G) \ X U {eij : {cj, ej} G tt}, where Cij joins yi and yj. Let 
U{tt) be the set of edges in X not belonging to pairs from tt, and Uziir) the set of edges of U{Tr) 
incident to z. 

Define a"", G N^(^('^)) by 

al = and 6J = max{6^ - \Uz{'k)\,Q] for &\lz&V\ {x] (= 1/(G(7r))). 
For each tt as above and T e {^^^J"^) (where {^) = {B C A : \B\ = k}), define r'^.^,s'^.^ e N^(0('^)) 

by 

r,-'^ = max{r,-|C/,(7r)\T|,0} 

and 

sf'^ = max{s^ - |[/^(7r) nT|,0} 

for all z eV\ {x}. 

Each a G OG(Tr){a^, b'^) maps naturally to a (unique) a G Oaia, b), namely: a agrees with a on 
E{G — x); orients all edges of U{tt) away from x; and orients from y^ to x and ej from x to yj 
whenever a orients Cjj from yi to yj (where, when yi = yj, we interpret one orientation of the loop 
Cij as yi — yj and the other as yj — >■ y^). Since each r G Ocia, b) is in the range of this map for 
exactly (^)/3! choice of tt, we have 

N{a,b) = ^Y.^Gi.){a\b-^). (25) 

Similarly, 

Since ^-h" > r"^"^ + s^'^ and > r"^"^ , s'^''^, it follows from the induction hypothesis that 

iVG(.)(a^6") < iVGw(r'^'^,5-'^) for all tt and T G (^W). (27) 
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(Note that <^(G(7r), a^, 6^) < ip{G,a,b), since \E{G{tt))\ < \E{G)\ and, for z G V\{x}, + > 
dz + bz — \Uz{Tr)\, while the degree of z in G(7r) is dz — \Uz{Tr)\.) Combining (25), (26) and (27), we 
have 

^(-'-) > (Py^E = ^ma,b) > N{a,b), 

where the last inequality follows from the assumptions a> p,a and a + (5 = p + a. 

So we may assume there are no saturated vertices. In this case we fix x G F with Qx > bx- (Of 
course if there is no such vertex, then a = b = r = s and (23) is an equality.) For 7, (5 G N let 
N'l'f, S) be the number of orientations of G with 

^ ^' (1,5) ify = x, 

and let AT" (7, S) be defined analogously with (r, s) in place of (a, b). Let a = Ux, P = bx, p = Vx, o = 
Sx, SO that (23) is 

N'{a,P)<N"{p,a). (28) 

By induction we have 

AT' (7, 6) < N"{ri, whenever 7>77,^and7 + (5 = 77 + ^>a + ^. (29) 
We apply this to the identity 

AT' (a, p) = N'{a, P + + N'{dx (30) 
If a > a, then, by (29), the right side of (30) is at most 

N"{p, a + l) + N"{dx -a,a) = N"{p, a). 
If a > p, then, again using (29), the right side of (30) is at most 

N"{p + 1, a) + N"{p, dx-p) = N"{p, a). 
(And, since a > P, we have at least one of a > cr, a > p.) 



The next result isolates (and generalizes) the main point in the derivation of (15) from Lemma 
8. Wc consider a hypergraph H = HiU 7(2 on a set W of size 21, where 

(i) the edges of Hi are pairwise disjoint and 

(ii) the edges of ^2 are of size 2 and pairwise disjoint. 

Let S be the set of vertices of T-L not covered by edges of 7^2, and 15"! = 2t. Given a : Tii N, 
let Ni be the number of partitions {X,Y) of W with each of X, y a vertex cover of 712, each of 
\X n H\, \Y n if I at least for each H eHi, and \X\ = i. 

Lemma 10 In the above situation, Ni > ^AT^+i. 
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Proof. For z G N and tt a collection of i — 1 disjoint 2-sets contained in S, let A/i(7r) be the set of 
partitions as above for which each of X, Y also covers the edges of tt, and set Ni = |Ai|. We assert 
that (for each tt) 

Ni{7t) > 2Ni+,{7r). (31) 
This implies the proposition since (as is easily seen) 



t-t\ ^ 

TV 



and 



For the proof of (31) let x, y be the two vertices of W not contained in members of •= ^2 Utt. 
Noting that {X,Y) G A/i+i('7r) implies {x,y} € X, we may regard {X,Y) as an orientation of ?^2i 
where orienting {u,v} from u to v corresponds to putting u in X (and f in y). The orientations 
corresponding to {X, F)'s from A/j+i are those for which, for each H eHi, 

d+{H)>a„-\Hn{x,y}\ and d-{H)>ajj, 

where, for the given orientation, d'^{H) (resp. d~{H)) is the number of oriented edges whose tails 
(resp. heads) lie in H. 

If we let G be the multigraph gotten from {H, tt) by collapsing each G Hi to a single vertex 
(so for example, any {u, v} G TL2 contained in some H G T-Li becomes a loop in G), then the above 
discussion says that A^/+i(7r) = Noia, h) (see Lemma 8 for the notation), where 

o-z = Oijj — \Hr\ {x, y}| and hz = 

if z is the vertex of G corresponding to G Hi, and = 6^ = if z is not of this type (i.e. 
z eW\U{H : H G rii}). 

A similar discussion shows that A^;(7r) = Ncir, s) + Ng{s, r), where 

rz = aH - ^{xeH} and = - l{yeH} 

if z is the vertex of G corresponding to H ^ T-Li, and = = if z is not of this type. (For 
example, NG{r,s) counts pairs {X,Y) with x G X (and y G Y).) 

Finally, Lemma 8 gives Ndr, s), NG{s,r) > NG(a,b), so we have (31). (Strictly speaking we 
may be applying Lemma 8 with some negative entries in b, r and/or s; but it's easy to see that this 
slightly more general version follows from the lemma as stated.) 



As mentioned earlier, the proof of (15) also requires Theorem 11 below. (If we just wanted (14) 
then Lemma 10 alone would suffice.) For /x G Mm '■= M{2^"^^), set 

«^(/^) = E{/^(^)m(^) ■■AC[m], \A\ = i} (32) 

(where A = [m] \A). Say fi G A42k has the antipodal pairs property (APP) if akin) > Q!fc-i(/i), and 
/J, G Mm lias the conditional antipodal pairs property (CAPP) if every measure obtained from fj, 
by conditioning on the values of some m — 2k variables (for some k) has the APP (where we view 
conditioning on the values indexed by T as producing a measure in A4(2^'^^\'^). 
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Theorem 11 ([11]) A measure with the CAPP and no internal zeros in its rank sequence is ULC. 

Proof of (15). It's again enough to show this when K = [n — I], I = {n} and J = 0. Setting 
U = a^^{K) and letting fi be the law of U, we prove the equivalent 

H is ULC. (33) 

For A C [m] and a G K^, say cr G Q if it satisfies the conditions in (7), which we now rewrite 

Sj<\cJ-\j)\<Tj yj€K, (34) 

where Sj = at-{i) and Tj = at-+i{j) — 1. (This extends the Q of (7), which was a subset of [n]["*l.) 
Write a ~ A if (J G fl Q and a ~ Hf a ~ ^ for some A of size I, and set T{A) = n{7m '■ i G 
[m] \ A}. Then jj, is given by 

li{A) oc T{A) ^{M/^(a) : a ~ ^} (AC [m]). 

By Theorem 11 we wih have (33) if we show that /i satisfies the CAPP and its rank sequence 
has no internal zeros. The latter condition is given by Proposition 6, applied with 

•'^ \ if j = n 

(so that n{\U\ = k) = Mf{Si, . . .,Sn-i,m, - k). 

To show that /i has the CAPP, we should verify the APP for the conditional measures fix,Y G 
My\x given by 

fix,YiA)cxiiiAuX) iACY\X), 

where X CY C [m] and \Y \X\ = 2k (for some k). Fixing X, Y and letting A, B run over subsets 
of Z := y \ X, this amounts to 

T{X)T{Y) II{W^(c^)^(^) : CT ~ A U r ~ (Z \ A) U X} 
\A\=k 

>^^T(X)r(y) $^Wa)W^(^) :a~BUX,^~(Z\5)UX} (35) 

^ \B\=k-l 

Regard each of a, r, a, ^ in (35) as a bipartite graph on the vertex set [m] U i^T in the natural way. 
Then for each pair (cr, r) appearing in (35) the multiset union G = cr U r is a bipartite multigraph 
with exactly 2(|X| + A;) edges and 

f 2 liieX 
doii) = < 1 ifieY\X 
[ if i G [m] \ y 

(and similarly for pairs (a, /?)). We may thus rewrite (35) as 

H ^{VF(cT)VF(r) ■.aUT = G,a^\X\ + k} 
G 

> J2 13{^(«)^(/5) : a U ^ = G, a ~ |X| + fc - 1}, 

A; ^ 
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and it is enough to show that for each fixed G we have the corresponding inequality for the inner 
sum, i.e. 



^{VF(a)W^(T) : aUr = G, a ~ \X\+k} > ^{W^(a)M/(/3) : aU^ = G, a ~ \X\+k-l}. (36) 

This has the advantage that the weights no longer play a role, since for cr, /3 as in (36) , 

W{a)W{T) = W{a)W{^); 

so we will have (36) if we show 

where Ni = Ni{G) is the number of partitions G = 7 U 5 with 7 
Now let H = Til U 7^2 be the hypergraph with vertex set W 

Hi = {Hj : j G K}, 

where Hj = {e £ W : j £ e}, and 

^2 = {{e, /} : e 7^ /, e and / have the same end in [m]}. 
Define a : N by 

a„. = max{Sj,dG{j) - Tj} 

(recall Sj,Tj were defined following (34)). We are then in the situation of Lemma 10: a partition 
W = WiUW2 with l^il = \X\+i (and IVF2I = \Y\ - i) as in the lemma (i.e. with {Wi, W2) in 
place of (X, Y)) is the same thing as a partition G = 7 U 5 with 7 ~ \X\ + i (and 5 ~ |y| — i), and 
the t in the lemma is equal to the present k\ so we have (37). 



(37) 

^ \X\ +i. 
= EiG), 



Remark. Log-concavity results being of some interest, we mention one appealing specialization 
of (15). (See e.g. [16] or [2] for much more on log-concavity in combinatorial settings.) For a 
bipartite graph G = {V \J K,E), define a G-map to be a function f : A ^ K with A C V and 
{v, f{v)) G E for all v E A. Given l,u ^ (with Ij < Uj), call a G-map valid if \f~^{j)\ G [Ij, uj] 
for all j G K. Let Sk = Sk{G, I, u) be the number of valid G-maps f : A ^ K with \A\ = k. 

Theorem 12 For any G, Z, u the sequence (sq, • • ■ , s\v\) ultra-log-concave. 

In the special case Z = 0, it = 1, Sfc becomes $fc = $fc(G), the number of matchings of size k in G. 
Heilman and Lieb [8, 9] and Kunz [12] (see also [13, Chapter 8]) proved that for any (not necessarily 
bipartite) graph G, the matching generating polynomial 

V 

p{x) = ^kX^ 

(where, as usual, u = maxjfc : <I>fc > 0} is the matching number of G) has all real (negative) roots. 
This implies, by Newton's inequalities (e.g. [7, Theorem 51]), that 

($0, . . . , ^.) is ULC, (38) 
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which, if u < \V\, is somewhat stronger than this case of Theorem 12. In contrast, for general I 
and u the polynomial 

\v\ 

k=0 

need not have all real roots. (For example, let V = {y, v, w}, K = {!}, E{G) = {{y, 1}, {u, 1}, {w, 1}}, 

h = l,tii = 3.) 

Actually Theorem 11 can be used to show that for any G, (<I>o, ■ ■ ■ ■, ^r) is ULC (where, as usual, 
r = t{G) is the vertex cover number), which in particular recovers (38) when G is bipartite. As this 
doesn't use Lemma 8, we won't go into it here. It would be very interesting to see a combinatorial 
proof of (38) for general G. 

Before closing this section we point out one further consequence of Lemma 8, which seems to 
us interesting for its own sake. With notation as in the above proof of (15), set f{A) = '^{W{a) : 
a G n Q} {AQ [m]). We assert that / satisfies the negative lattice condition: 

f{Ay^B)f{A^^B)<f{A)f{B) ^A,B<Z[m\. (39) 

While we don't see how to get (14) (or (15)) from this in general, it's not hard to see that it does 
imply (14) in case the (7(i)'s are i.i.d., so gives yet another proof Theorem 2. We omit the details. 

Proof of (39) . We may rewrite the inequality as 

^ Y,{W{<t)W{t) :(7~AUS,r-^nB}<^ ^{W{a)W{P) : a ~ A, /3 ~ 5}. (40) 

As before we regard cr, r, a, /3 in (40) as bipartite graphs on [m] U K. For each pair (a, r) appearing 
in (40), the (multiset) union G = cr U r is a bipartite multigraph with 



2 \ii<^ A<r\B 
1 if i G AAB 
otherwise 



(and similarly for pairs (a, /3)), and it's enough to show that, for each such G, (40) still holds if we 
restrict to pairs (a, r) and (a, /3) with 

aUr = aU^ = G. (41) 

Again the weights {W{(t) etc.) cancel and it's enough to show 

N{A\JB,Ar\B) <N{A,B), (42) 

where, for C,D C [m], N{C, D) = Ng{C, D) is the number of partitions E{G) = 7 U 5 with 7 ~ G 
and S D. 

Notice now that we are really counting partitions a L) f and a U /3 of the edges of G' := 
G[{A n B) U K], since for any cr, . . . ,/3 (as in (40)) satisfying (41), any edge of G with an end in 
A\B (resp. B\A) must belong to a n a (resp. a H P). 

For j E K and G C [m] write dc{j) for the number of edges of G joining j to C. In terms of 
a,. . . ,P the requirement that a,...,P satisfy (34) becomes the condition that for each j G K, 

Sj - dAABij) < < Tj - dAABU); Sj < \f-\j)\ < Tj; 

Sj - dA\B{j) < < Tj - dA\B{j) and Sj - dB\A{j) < < Tj - dB\Aij)- ^ ' 
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Now let H be the multigraph on vertex set E{G') with edge set {ex : x G A n B}, where ex joins 
the two edges of G' containing x. We may identify a partition E{G') = 7 U (5 with the orientation 
of H gotten by directing Cx from a to 6 whenever a € 7 and b G 6, where a, b are the edges on x in 
G' . The orientations corresponding to pairs ((5",f) as in (43) are then those satisfying 

Sj - dAAsU) < d+{j) < Tj - dAAsU) and Sj < d-{j) < Tj \/j G K 

while those corresponding to pairs (d, 13) are those with 

Sj - dA\B{j) < d+ij) < Tj - dA\B{j) and Sj - dB\A{j) < d-{j) < Tj - dB\Aij) Vi G K. 

That the number of orientations of the first type is at most the number of the second type is then 
an instance of Lemma 8. 



5 Final remarks 

The most interesting question left open by the present work is whether Theorem 1 (even without 
thresholds) extends to nonidentical balls; that is, 

Question 13 Are generalized urn measures (or generalized threshold or interval urn measures) 
CNA? 

As mentioned in the introduction, Dubhashi and Ranjan [5] showed NA for the ^j^'s defined in (3), 
which immediately gives NA for generalized threshold urn measures. That the weaker (than CNA) 
CNC, at least, docs hold for generalized threshold (or, more generally, "interval") urn measures is 
a special case of the following result; this is a somewhat more general version of Corollary 34 of [5], 
which implies CNC for generalized threshold urn measures. We again take Pr to be the measure 
on [n]^"^^ corresponding to some 7, and, for A C 2["^1 and a,b E N""^, set p{A,a,b) = Fr{a~^{n) G 
A\Bje[aj,bj]yje[n-l]). 

Theorem 14 For any increasing A, p{A,a,b) is decreasing in (a, 6). 

The proof is more or less the same as that of Theorem 4(a), so will not be given here; see [14, 
Theorem 1.23]. (The proof of Corollary 34 in [5] is not quite correct, since it depends on the 
incorrect Proposition 24.) 

Thus one reason to be interested in whether generalized urn measures are CNA is that a negative 
answer would provide a counterexample to an important conjecture of Pemantle [15] stating that 
CNC implies CNA. (He also conjectures that the Rayleigh property NC+ implies NA+.) Pursuing 
this a little further, say fi G M{{0, 1})" is R+ if every W o ^ with Wi G {0} U [l,oo) Vi is NC. 
An easy simulation shows that CNC for the class of generalized urn measures is the same as 
for this class. (Note this is without thresholds; it's easy to see that i?"*" need not hold for (even 
ordinary) threshold urn measures.) So failure of CNA here would in fact disprove 

Conjecture 15 i?"*" implies CNA, 

a weakening of the first conjecture of Pemantle above. 

At this writing we can (e.g.) give a positive answer to Question 13 when each ball chooses 
from just two urns; this is of course quite special, but seems of some interest since it corresponds 



16 



to in- and out-degree statistics for a random orientation of a graph (where edges are oriented 
independently, but the two orientations of an edge may have different probabiHties). Even this 
special case seems to require an interesting argument, but we will not give this here as the paper 
seems long enough without it. 

As far as we can see, even the following very general statement could be true. 

Question 16 Suppose TqUTiU ■ ■ ■ UTg is a partition of [m] x [n], and a^, fer € N for r = 1, . . . , s. 
Is it true that the ^ij 's in (3) are NA given 

{C{Tr) € [ar,br] Vr G [s]} 

(where aT) = j:ii,j)eT^ij)? 

This would be a considerable strengthening of CNA for generalized threshold urn measures. 

Let us also just mention one possible approach to Question 13. Recall that for A4{{0, 1}"), 

fi stochastically dominates v (written ji v) \i IJ-iA) > I'iA) for each increasing A Q {0, 1}", and 
that has the normalized matching property ii, with X chosen according to fi and = //(-j^ = k) 
is stochastically increasing in k (meaning, of course, that /x(-|^ = k) h m(-|^ = I) whenever k > I). 
It is not too hard to show (this is somewhat like the derivation of CNA from CNC in [6]) that CNA 
for generalized interval urn measures would follow from a positive answer to 

Question 17 Is it true that for any a G [n]'"*' with law given by (6) and Q as in (7), the law of 
a~^{K) given Q has the normalized matching property? 

Finally we turn to the conjectures of Farr (unpublished circa 2004; see [18]) and Welsh [18] 
mentioned at the end of Section 1. To put these in our framework, we add an urn A and assume 

Pr((7(i) = j) = p G [m\,j G [n]. 

(So Pr((7(i) = A) = 1 - np.) Let I C 2H be decreasing and set Aj = {cr~^{j) G 1} and 
Aj = C\{Aj : j G J}. Then Farr's conjecture (somewhat rephrased) is 

Conjecture 18 If G is a graph on [m] and Z is the collection of independent sets of G, then for 
any disjoint I,J,KC. [n] , Ai i Aj given Ak ■ 

It's not clear why this should require that X be of the type described, and Welsh's conjecture was 
that the same conclusion holds for an arbitrary X. Here we sketch a counterexample to this stronger 
version. At present we don't sec how to extend to a counterexample to Conjecture 18, though we 
feel that this too is likely to be false. 

Example. Let n = 3 and p = 1/3 (so we don't need A). Let MU^UBUCbea partition of 
V := [m] with \M\ = s (large) and |^| = \B\ = \C\ = t = s + 3. Let Xi = 2^\^, 

X2 = {X CV :\XnM\< A\M\ and X meets at most two of A, B, C}, 

and X = Xi U22. Then, we assert, 

Pr(yl{3})Pr(yl{i,2,3}) > Pr(^{i,3})Pr(^{2,3}), 
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which contradicts Welsh's conjecture (with / = {1}, J = {2} and K = {3}). We omit the precise 
calculations; roughly, with a = (2/3)* and c = (3/2)^, we have (as t — >■ oo) 



Pr(^L) 



(c + 3)a if|L| = l 
if \L\ = 2 
if ILI = 3. 
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